Attention-guided chained context aggregation for semantic segmentation

نویسندگان

چکیده

The way features propagate in Fully Convolutional Networks is of momentous importance to capture multi-scale contexts for obtaining precise segmentation masks. This paper proposes a novel series-parallel hybrid paradigm called the Chained Context Aggregation Module (CAM) enrich feature representation. CAM gains various spatial scales through chain-connected ladder-style information flows and fuses them two-stage process, namely pre-fusion re-fusion. serial flow continuously increases receptive fields output neurons those parallel encode different region-based contexts. Each shallow encoder-decoder with appropriate down-sampling sufficiently contextual information. We further adopt an attention model guide Based on these developments, we construct Network (CANet), which employs asymmetric decoder recover details prediction maps. conduct extensive experiments six challenging datasets, including Pascal VOC 2012, Context, Cityscapes, CamVid, SUN-RGBD GATECH. Results evidence that CANet achieves state-of-the-art or competitive performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Context Encoding for Semantic Segmentation

Recent work has made significant progress in improving spatial resolution for pixelwise labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous convolution, utilizing multi-scale features and refining boundaries. In this paper, we explore the impact of global contextual information in semantic segmentation by introducing the Context Encoding Module, which captures ...

متن کامل

Object Boundary Guided Semantic Segmentation

Semantic segmentation has been a major topic in computer vision, and has played an important role in understanding object classes as well as object localizations. Recent development in deep learning, especially in fully-convolutional neural network, has enabled pixel-level labeling for more accurate results. However most of the previous works, including FCN, did not take object boundary into co...

متن کامل

Semantic Segmentation with Reverse Attention

Recent development in fully convolutional neural network enables efficient end-to-end learning of semantic segmentation. Traditionally, the convolutional classifiers are taught to learn the representative semantic features of labeled semantic objects. In this work, we propose a reverse attention network (RAN) architecture that trains the network to capture the opposite concept (i.e., what are n...

متن کامل

Context Tricks for Cheap Semantic Segmentation

Accurate semantic labeling of image pixels is difficult because intra-class variability is often greater than inter-class variability. In turn, fast semantic segmentation is hard because accurate models are usually too complicated to also run quickly at test-time. Our experience with building and running semantic segmentation systems has also shown a reasonably obvious bottleneck on model compl...

متن کامل

Mixed context networks for semantic segmentation

Semantic segmentation is challenging as it requires both object-level information and pixel-level accuracy. Recently, FCN-based systems gained great improvement in this area. Unlike classification networks, combining features of different layers plays an important role in these dense prediction models, as these features contains information of different levels. A number of models have been prop...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Image and Vision Computing

سال: 2021

ISSN: ['0262-8856', '1872-8138']

DOI: https://doi.org/10.1016/j.imavis.2021.104309